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Abstract 

We analyze the asymptotic behavior of compute-and-forward relay networks in the regime of high signal-to- 
noise ratios. We consider a section of such a network consisting of K transmitters and K relays. The aim of the 
relays is to reliably decode an invertible function of the messages sent by the transmitters. An upper bound on 
the capacity of this system can be obtained by allowing full cooperation among the transmitters and among the 
relays, transforming the network into a K x K multiple-input multiple-output (MIMO) channel. The number of 
degrees of freedom of compute-and-forward is hence at most K. In this paper, we analyze the degrees of freedom 
, achieved by the lattice coding implementation of compute-and-forward proposed recently by Nazer and Gastpar 

■ We show that this lattice implementation achieves at most 2/(1 + l/K) < 2 degrees of freedom, thus exhibiting a 
^ f very different asymptotic behavior than the MIMO upper bound. This raises the question if this gap of the lattice 

■ implementation to the MIMO upper bound is inherent to compute-and-forward in general. We answer this question 
, in the negative by proposing a novel compute-and-forward implementation achieving K degrees of freedom. 

I. Introduction 

The two central problems of reliable communication over a wireless relay network are the signal 
^ ■ interactions introduced by the wireless medium and the additive noise experienced at the nodes in the 
I — \. network. Traditional approaches of dealing with these problems fall broadly into two categories. On the 
[ one hand, intermediate relays in the network can try to completely remove the receiver noise. The decode- 
> ; and-forward scheme (see [[Il-[|3l, among others) falls into this category. While this solves the problem of 
^ ■ noisy reception, its performance is adversely affected by the signal interactions, which are usually avoided 
^ \ by careful scheduling of transmissions. On the other hand, intermediate relays can try to make use of 
CN [ the signal interactions introduced by the channel either by not removing the additive noise at all, or by 
^ ■ only removing it partially. Schemes such as amplify -and-forward (see, e.g., jll, [ffl-jll) or compress-and- 
Q ■ forward (see, e.g., [[D, [|3l, ir7]|- [[T0ll ) fall into this category. Since noise is not or only partially removed 
at the relays, these schemes suffer from noise accumulation. 

A third approach, referred to as either compute-and-forward [fTTI . [fT2]| . physical-layer network coding 
[fT3l . [[T4l . or analog network coding [[TSl . aims to both harness the signal interactions introduced by the 
channel and remove the noise at the relays. This is achieved by allowing the relays to decode noiseless 
\ functions of the transmitted messages. At the destination node all the information streams are combined 
to determine the original messages being sent. In this paper, we examine the design and performance of 
such schemes. 

A small example illustrates the approach, see Fig. [T] Consider a section of a larger relay network 
with 2 transmitters and 2 relays. The channel gains {hm,k) between the transmitters and the relays are 
assumed to be constant and known throughout the network. The transmitters have access independent 
messages Wi,W2, which are separately encoded, modulated, and then sent over the channel. The relays 
receive a linear combination of these transmitted signals corrupted by additive noise. Each relay decodes 
independently; however, the receivers do not aim to decode the original messages wi,W2. Rather, each 
relay m decodes an intermediate quantity Um, which is a noiseless function of the messages wi and W2. 
Crucially, these functions are chosen to be adapted to the channel gains. In other words, the computation of 
the functions ui, U2 is aided by the signal interactions introduced by the channel. Following the decoding 
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Stage, the decoded functions ui,U2 are combined and inverted to recover the original messages Wi,W2- 
This combining and inverting of the decoded functions is to be interpreted as taking place at the destination 
node (not explicitly modeled in this scenario), which is interested only in the original messages. 
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Fig. 1. A section of a relay networlc witii two transmitters and two relays. 

In IfTTTl . Nazer and Gastpar propose an ingenious coding scheme for compute-and-forward using lattice 
codes (see lfT6l . ifTTl . among others) at each transmitter. These lattice codes have the property that any 
integer linear combination of two codewords is again a codeword. Due to the additive nature of the 
channel, each relay receives a linear combination of the lattice codewords (which is again a codeword) 
plus some additive noise. The relays then decode the linear combination of the codewords, removing the 
noise. The relays thus decode a noiseless function of the messages. In terms of our example with two 
sources and relays, we see that the decoded quantities ui,U2 are linear functions of the messages wi,W2 
in this case. Assuming the resulting system of linear equations is invertible, the original messages wi and 
W2 can be recovered at the final destination from ui and U2. 

However, there is a subtle difficulty with this approach that we have neglected in the above description. 
The lattice property of the codes ensures only that any integer linear combination of codewords is again 
a codeword, whereas the linear combination computed by the wireless channel can have arbitrary real 
(or complex) coefficients. To overcome this difficulty, [ITTIl proposes to scale the received channel output 
so that the scaled received linear combination of codewords is close to an integer linear combination. In 
general, the larger the scaling factor the better the approximation, increasing achievable rates. At the same 
time, a larger scaling factor results in amplification of noise, decreasing achievable rates. 

We hence see that there is a tradeoff between closeness of approximation and noise amplification. This 
tradeoff is a central theme in the field of Diophantine approximation, which studies the approximation of 
real numbers by rationals, and we will refer to this as the Diophantine tradeoff in compute-and-forward. 
The rates achievable by the lattice coding implementation of compute-and-forward in [[TTll are not given 
by an analytic expression, but rather as the solution to an optimization problem, in which this tradeoff 
appears implicitly. It is hence not clear how significant the loss due to this Diophantine tradeoff is. 

In this paper, we show that the loss in rate due to this tradeoff is indeed significant at high but still 
realistic values of signal-to-noise ratio (SNR), say 20dB and above. In particular, for the two-user example 
discussed earlier, we show that due to this Diophantine tradeoff the compute-and-forward scheme in IfTTl 
achieves only one degree of freedom (capacity pre-log factor), the same as time sharing between the 
transmitters. In other words, in the two-user case, the compute-and-forward implementation in [[TTl and 
time sharing have the same high-SNR behavior. For the general case with K transmitters and K relays, 
we show that the lattice scheme achieves at most 2/(1 + 1/A') < 2 degrees of freedom. While potentially 
better than time sharing, this is considerably worse than the MIMO upper bound of K degrees of freedom 
that would be achievable with full cooperation among the transmitters and among the relays. 

This negative result raises the question as to whether this Diophantine tradeoff and the associated 
loss are inherent to compute-and-forward as a scheme in general or whether they are an artifact of the 
implementation in IfTTl . We show that the latter is the case and that compute-and-forward in general does 
not suffer from this tradeoff. To this end, we propose a novel implementation of compute-and-forward that 
achieves K degrees of freedom, matching the MIMO upper bound. Thus, compute-and-forward can achieve 
the same asymptotic rates as if cooperation among the transmitters and among the relays were allowed. 
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The proposed achievable scheme introduces the concept of signal alignment, related to the alignment of 
interference. This alignment of signals is crucial to achieve the K degrees of freedom upper bound, and 
indicates that the compute-and-forward problem and the interference channel problem are closely related. 

The remainder of this paper is organized as follows. Section |II] provides a general formulation of the 
compute-and-forward setting. Section [ni] states the main results. Proofs are presented in Sections HVl - IVIl 
Section Ivn] contains concluding remarks. 

IL Problem Statement and Notation 

A. Notational Conventions 

Throughout this paper, we use the following notational conventions. Vectors and matrices are written in 
bold font in lower and upper case, respectively, e.g., h and H. For a matrix H, its transpose is denoted 
by , and its determinant by det{H). For a vector h, we write \\h\\ for its Euclidean norm. We denote 
Lebesgue measure by /i. We say that a property holds for almost every H if the set B of H for which 
the property does not hold has Lebesgue measure fi{B) equal to zero. Finally, all logarithms are to the 
base 2, and therefore channel capacities are expressed in bits per channel use. 

B. Problem Statement 

We consider a section of a relay network with K transmitters and K relays modeled by a discrete-time 
real Gaussian channelQ The channel output ym[t] at receiver m G {1, . . . , K} and time t G IN is 

K 

ym[t] = ^ hm,kXk[t] + Zm[t]. (1) 
k=l 

Here G R is the channel input at transmitter k E {1, . . . , K}, hm,k G R. is the channel gain between 
transmitter k and receiver m, and Zm[t\ G R. is additive white Gaussian noise with zero mean and unit 
variance. Note that the channel gains {hm,k) are deterministic and constant across time. As such, they are 
known throughout the network. To simplify notation, let the row vector 

be the channel gains to receiver m, and set 

/h^\ 
ho 

\hK/ 

Transmitter k has access to an independent message Wk uniformly distributed over {0, 1, ... , Wk — !}• 
The goal of receiver m is to compute the (deterministic) function 

Um = amiwi,W2, • • • , Wk) G {0, 1 , . . . , t/^ - 1 } . 

Since is a deterministic function, its range can contain at most Um < 11^=1 elements. We impose 
that the messages (wk) can be recovered from the decoded equations (um), i.e., that the vector map 
induced by the K functions (am) is invertible. 

Formally, a block code of length T and power constraint P consists of K encoders 

h: {0,...,iyfc-l}^R^ 



'Throughout this paper, we assume real channels. Using arguments similar to the ones in 1181 . 1191 . the results can be extended to hold 
for complex channels as well. 
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for A; G {1, ... , K}, mapping the message Wk to channel inputs 

such that 

and K decoders 

0^: R^^{0,l,...,f/„-l} 
for m G {1, . . . , K}, mapping the channel outputs iym[t])J=i to the estimate 

of Um- The probability of error of this block code is 

Observe that the probability of error is defined with respect to the equations iim and not the original 
messages Wk- The (sum) rate of this block code is 

1 ^ 

k=l 

A rate R{H,P, (am)) is achievable if for every ?7 > there exists a block code of length T and power 
constraint P with probability of error less than rj and rate at least R{H,P,{am))- The computation 
capacity for functions (a-m), denoted by C{H,P, (am)), is defined as the supremum of achievable rates. 
Finally, define the computation capacity 

C(/f,P) ^ sup C(/f,P,(a„,)), 

(Um) 

where the supremum is over all invertible (deterministic) functions (0^)- 

Note that in this definition of computation capacity, it is irrelevant which functions the receivers decode, 
as long as all the decoded equations allow recovery of the original messages. This requirement is best 
understood in the context of a larger relay network, in which the channel considered here is only one 
component of the network, and the receivers here correspond to intermediate relays. The invertibility of the 
map (om) guarantees that collectively the decoded equations (um) at these relays contain all the information 
about the messages (wk) at the transmitters. However, the decoded equations have to be deterministic, 
i.e., all noise introduced by the channel has to be removed at the relays. This ensures that noise is not 
forwarded further down the larger relay network. These two requirements (invertibility and noise removal) 
are the essence of the compute-and-forward approach. We point out that decode-and-forward is a special 
case of the above definition in which the function are given by 

for all m. On the other hand, schemes like amplify-and-forward or compress-and-forward do not satisfy 
the above definition, since they compute randomized (i.e., noisy) functions of the messages. 

While the above definition of computation capacity allows for arbitrary functions it is worth 
mentioning the special case of linear functions. In this case, receiver m aims to compute the function 

K 

«m = ^ a'm,kU!k, 
k=l 
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with a™ k G Define the row vector 

— (Om,l 0'm,2 ' ' ' 0'm,K) 



and the corresponding matrix 



A 



The messages {wk) can in this case be recovered from the decoded equations {um) if the matrix A is 
full rank. With slight abuse of notation, we write C{H, P, A) for the computation capacity for the linear 
function determined by the coefficient matrix A. 

In the remainder of his paper, we will be interested in the degrees of freedom of the computation 
capacity C{H,P) defined as 

lim?i^ 
P^- ilog(P) 

assuming the limit exists. If this limit is equal to D, then 

C{H,P) = ^\og{P) + o{\og{P)) 

as P — >■ oo. Thus, the degrees of freedom describe the behavior of C{H,P) at high SNR. Since the 
o(log(P)) approximation alone can be quite weak, we will provide tighter second-order asymptotics as 
well. 



III. Main Results 

Nazer and Gastpar [fTTl Theorems 1 and 2] provide an achievable scheme based on lattice codes for 
computation of linear equations over the channel ([T]), showing that, for A E Z^^^, 

C{H,P,A) > Ri{H,P,A) 

K 

Emin R\_{hm, P, am) 



K 



= min ( ^log(l + P\\hmf) - ^log( ||a„f + pf ||h„,f ||a„f - (/i„aD^) ) 

(2) 

is achievable. We emphasize that ([2]) is only valid for integer matrices A G Z^^^. This restriction turns 
out to be a significant limitation, as we will see later. 

Let us interpret the terms in the definition of Ri{hm,P,am)- The first term corresponds to the sum 
capacity of a multiple-access channel with channel gains hm- The second term represents the rate loss 
incurred by using the coefficients a^. This rate loss, governed by 

Wamf + P(\\h. 



(ll^mf llOmf - (^malj^), (3) 



consists of two parts: the squared norm of a„, and the power P times the gap arising from the Cauchy- 
Schwarz inequality, which is therefore nonnegative. This second term is zero if and only if a^ and hm 

^This setting can be slightly generalized by considering a vector of messages Wk (instead of a scalar Wk) and computing Uk by applying 
the same linear function to every component of w^- The distinction between the scalar and vector cases is immaterial for the purpose of 
this paper, and we will refer to both as linear computation. 
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are coUinear. Recall that has integer components and can therefore not be chosen to be coUinear to 
hm in general. Denote by 

RdH,P)= max RdH,P,A) 

the largest rate achievable with the lattice scheme proposed in [fTTl FI 

As mentioned earlier, the scheme by Nazer and Gastpar uses lattice codes, which have the property that 
every integer linear combination of two codewords is again a codeword. With this approach, the receivers 
directly decode the linear combinations (um) and never explicitly decode the messages (wk). A different 
approach would be to choose A = / so that Um = Wm. for all m. This can be implemented by time 
sharing between all the transmitters, achieving a sum rate of at least 



C{H,P)>C{H,PJ) 
^ 1 



fc=i 

For A = I, the problem actually reduces to the standard interference channel, for which interference 
alignment achieves 

C{H,P)>C{H,PJ) 

>^log(P)-o(log(P)) (5) 

as P — J- oo for almost every channel matrix H ll20l . As P — oo, this rate is the best achievable for 
A = I and almost every H, as it is shown in [l2T| that 

C{H, P,I)<^ log(P) + o(log(P)). (6) 

Finally, by allowing cooperation among the transmitters and among the receivers, the computation rate 
can be upper bounded by the capacity of the MIMO channel with the same channel matrix H. This can 
be further upper bounded by relaxing the per-antenna power constraint to a sum power constraint, so that, 
by m, 

C{H, P) < max ^ log det (/ + HQH^) , (7) 

where the maximization is over all covariance matrices Q with trace at most KP. 

To compare the upper bound (|7]) to the lower bounds ©, dH), and ©, it is insightful to consider their 
asymptotic behavior as power P grows. The time-sharing lower bound (31) yields 

P^<^ |log(P) ~ 

i.e., time sharing achieves one degree of freedom. The interference-alignment lower bound Q yields 

liminf ?^:^,^ >K/2 
ilog(P) - / 

for almost every channel matrix H, i.e., interference alignment achieves K/2 degrees of freedom. On 
the other hand, almost every channel matrix H has full rank, in which case the MIMO upper bound Q 
yields 

C(H P) 

limsup , <i^, (8) 

^By II II Lemma 1], a maximizing A G 
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i.e., the corresponding MIMO channel has K degrees of freedom. Thus, at high SNRs, time sharing and 
interference alignment behave very differently from the MIMO upper bound for almost every H. Observe 
that by Q any scheme using decode-and-forward, i.e., with coefficient matrix A = I, achieves at most 
K/2 degrees of freedom for almost every H. Hence, if we are to attain the upper bound of K on the 
degrees of freedom, the use of general compute-and-forward (as opposed to simple decode-and-forward) 
will be necessary. 

The behavior of the rate -Rl achieved by the lattice scheme is more difficult to evaluate. If H E Z^^^ 
has integer components and is invertible, we can set A = i? in Q to obtain 

Rt{hm,P,hm) = ^log(l + P||/i^f) -ilog(llh^f). 

Hence, in this case, 

liminf ^^^^;^^ >i^-. 

P->oo i log(P) 

More generally, if H E Q^^^ has rational components and is invertible, then there exists a g G IN such 
that qH E Z^^^. Setting A = qH in Q yields that lattice coding achieves a rate of 

1 1 
Ri{hm,P,qhm) = -log(l + P||/l„^f) - - log (g^ h„ f ) , 



and again 



liminf5t(^>i^. 



P^oo 1 log(P) 



KxK 



Since R\_ < C, we obtain together with the MIMO upper bound (|7]) that for invertible G Q 

4t(Z^ = K. 

llog(P) 

In other words, for invertible H with rational components, the scheme based on lattice coding is 
asymptotically optimal. In particular, this implies that the lattice scheme significantly outperforms the 
schemes based on time sharing and based on interference alignment. 

However, the requirement of rational channel gains H is quite strong. In fact, this event has Lebesgue 
measure zero. The question arises whether the behavior of the rate R]_ achieved by the lattice scheme of 
ifTTI is significantly altered if we relax this assumption of rational channel gains. The next theorem shows 
that this is indeed the case. In fact, for almost all channel gains, the lattice scheme has an asymptotic 
behavior that is not significantly better than time sharing. 

Theorem 1. For any K > 2 and almost every II E R^^^ there exists a positive constant ci = Ci{K, H) 
such that for all P > 3 

Rl{h, p) < Y^Y/K + logiog(^)- 



In particular, this implies that for any K > 2 and almost every H eR 

lim sup — PI — - — — < 



KxK 



P^J ilog(P) - 1 + 1/K- 
We remark that, for K = 2, Theorem [U can be sharpened to 



p^oo a™eZ2\{0} ^log(P) 

for almost every hm G R^, so that 

lim sup — j ; — — < 1 

ilog(P) - 



r R\_{hjn,P,am) I „ 
limsup max —^ — — ^ £1/^; (") 
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for almost every H G R^^^. 

Theorem [T] shows that for almost every channel matrix H there is only limited asymptotic gain over 
time sharing by using the lattice scheme in [ITTIl . In particular, for K = 2, time sharing and lattice coding 
achieve the same degrees of freedom. For large K, the upper bound in Theorem [T] is approximately 2 — 
better than time sharing, but still far off from the K/2 degrees of freedoms achievable with interference 
alignment and the MIMO upper bound of K degrees of freedom. In other words, it seems to suggest 
that, at high SNR, compute-and-forward offers only limited advantage over standard coding schemes. This 
conclusion turns out to be misleading, as we will see later. 

The bad asymptotic performance of the lattice scheme is due to the rate loss term ([3]). As pointed out 
earlier, to make the second term in ([3]) small, the coefficients should be as close to coUinear to the 
channel gains hm as possible. However, since is forced to be an integer vector, and since hm is a 
real vector, this is in general only possible by increasing the norm of a^. This, in turn, increases the 
first term in (|3]). The tradeoff between the two terms in ([3]) is a main theme in the field of Diophantine 
approximation. In particular, the proof of Theorem [T] builds on a result of Khinchin to show that, for 
almost every channel gain hm, the coefficient vector can only be close to coUinear to hm if ||am|| is 
large. 

Example 1. Consider the channel vector h = (1 /i2) to one of the receiver. Consider 

max Ri(h,P,a), 

aeZ2\{0} 

the maximal rate at which any (nontrivial) integer linear equation can be decoded at the receiver. From 
©, we know that 

limsup max — j ; — - — < 1/2 

P^oo aeZ2\{0} ilog(P) 

for almost ever)0 /i2 G IR. On the other hand, for /i2 G Q, 

RL{h,P,a) ^ 



lim max 



P^ooaeZ2\{0} |log(P) 

While these statements are only valid asymptotically as P — ?> oo, this qualitative behavior is already 
visible at moderate values of SNR, as is depicted in Fig. [2l 

We now introduce a different implementation of the compute-and-forward approach that achieves K 
degrees of freedom, matching the asymptotic behavior of the MIMO upper bound ([8]). In other words, 
even though both the receivers and the transmitters are distributed, the proposed communication scheme 
achieves the same number of degrees of freedom as a centralized communication scheme in which all 
transmitters can cooperate and all receivers can cooperate. 

Theorem 2. For every K > 2 and almost every H G R^^^ there exist positive constants C2 = C2{K, H) 
and C3 = c^{K, H) such that for all P > 2 

y log(P) - 02 log^ (P) < C{H, P)<Y log(^) + 



In particular, this implies that for any K >2 and almost every G R 

P^oo |log(P) 



KxK 



Recall that the implementation of compute-and-forward in UU uses lattice/linear codes together with 
output scaling. The aim of this output scaling is to make the scaled channel gains close to integer. The 



''while ^ is stated for almost every h £ R^, the same arguments can be used to show that ^ also holds for almost every h of the form 

(1 h2). 
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Fig. 2. Normalized rate maxa Ri{h, P, a)/| Iog(l + Ai^P) achievable witli lattice codes ITTI with optimized coefficient vector a G Z^\{0} 
for channel gain h — {1 h2) as a function of h2 G [0, 1]. The plots are for a value P of 20dB, 30dB, 40dB, and 50dB (from top to bottom). 
As P — ^ oo, the normalized rate converges to at most 1/2 for almost every value of /12. On the other hand, for /i2 G Q (a set of measure 
zero), the normalized rate converges to 1. This limiting behavior can already be observed at the values of SNR shown here. 



difficulty with this approach is that the scaling of the channel outputs amplifies the additive receiver noise. 
In order for the scaled channel gains to be close to integer, the scaling factor should be large. On the 
other hand, in order to have small noise amplification, the scaling factor should be small. These two 
conflicting requirements result in the Diophantine tradeoff mentioned in the introduction. This tradeoff 
can be observed in the tension between the two terms in ([3]) as discussed earlier. 

Our proposed achievable scheme in Theorem [2] also uses linear codes at the transmitters. However, 
it avoids the scaling of the channel outputs and thereby the Diophantine tradeoff. Instead, we use a 
modulation scheme based on signal alignment over the real numbers to convert the real linear combinations 
produced by the channel into integer linear combinations. This step builds on a construction suggested 
recently for the alignment of interference in [|20l , [|23l . which itself is based on prior work on Diophantine 
approximation on manifolds ||24]| . ||25l . The proposed approach is best illustrated with an example. 

2 case. Assume the channel gains are of the form 

A - 
Xi = Wi, 

X2 = W2, 



Example 2. Consider agam the K = 



Set the channel input to be 
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where both are codewords from the same lattice code. The channel output is then 

yi = wi + h2W2 + zi, 

X2 = hiWi + W2 + Z2. 

Given that both codewords are from the same lattice code, one might hope that an integer combination of 
them might be decodable at higher rates than the individual messages themselves. However, the arguments 
in Theorem [T] show that, for almost all H and at high enough SNR, each receiver can essentially decode 
both wi and W2 whenever it can decode an integer combination of them. This limits the computation rate 
to one degree of freedom. 

A simple improvement over this scheme is to set 

Xi = Wi, 
X2 = hiW2, 

The channel output is now 

yi = Wi-\- hih2W2 + 2i, 

X2 = hi{wi + W2) + Z2. 

This results in the signals wi and W2 to be both observed with the same effective channel gain hi at 
receiver two. In other words, we have signal alignment at the second receiver. However, the signals at the 
first receiver are still unaligned. This limits the computation rate to again only one degree of freedom. 
To achieve alignment at both receivers, split the messages into two parts, and set 

Xl = Wl^l + hih2'Wl,2, 
X2 = hiW2,l + h\h2W2,2- 

This results in the channel outputs 

yi = Wi^i + hih2{Wl,2 + W2,l) + h\h\w2,2 + Zi, 
X2 = + W2,l) + h\h2{Wl,2 + W2,2) + -^2- 

We now have partial alignment at both receivers. Receiver one decodes wia, w^i,2 + w;2,i, and W2,2- Receiver 
two decodes wi^ + W2,i and wi^2 + ^^2,2- It can be shown that this achieves a computation rate of 4/3 
degrees of freedom. 

By breaking the messages into more submessages and aligning them pairwise in the same manner, this 
construction achieves a computation rate approaching two degrees of freedom, as promised in Theorem [21 



Remark I: In the channel model ©, the channel gains H are assumed to be constant and as such 
known everywhere. In practice, this channel state information (CSI) would have to be estimated and 
distributed throughout the network, resulting in signaling overhead. 

Having access to CSI at both the receivers and the transmitters is critical for the operation of the 
compute-and-forward scheme proposed here (achieving K degrees of freedom) as well as for the in- 
terference alignment scheme in ||20l (achieving K/2 degrees of freedom). In contrast, the lattice coding 
implementation of compute-and-forward in IfTTI (and shown here to achieve at most 2 degrees of freedom) 
requires only CSI at the receivers but not at the transmitters. Whether the lattice coding scheme in IfTTI 
achieves the optimal degrees of freedom if the use of transmitter CSI is excluded is an open question. 

Remark 2: Throughout this paper, we have been concerned almost exclusively with degrees of freedom. 
The second-order asymptotics in Theorem [2] are quite poor, especially for larger values of K. Deriving 
tighter approximations valid for moderate values of SNR is an interesting direction for further investigation. 
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IV. Preliminaries: Diophantine Approximation 

In all of the proofs, we will be using facts from Diophantine approximation. Here we provide the 
necessary background as well as some extensions of well-known results. 

Let h he a real and a,q be integers. How well can h be approximated by the ratio a/g? Since the 
rationals Q are dense in the reals R, this can be done to any arbitrary degree of accuracy. However, to get 
a good approximation, the denominator q will, in general, have to be large. The question then becomes 
one of quantifying the tradeoff between the quality of approximation and the size of q. Formally, the 
problem is to analyze the behavior of 

minl/i — a/q\ (10) 

as a function of g e IN for fixed /i e R. A result due to Khinchin (see, e.g., Il26l Theorem 1]) states that 
if is a nonnegative function such that 

oo 

(11) 

<?=! 

converges, then for almost every h eR there exists a positive constant c = c{h) such that 

min|/i — a/q\ > cil){q) 

for all g G IN. On the other hand, if (fTTl) diverges, then for almost every /i G R and every positive constant 
c, there are infinitely many values of g G IN such that 

min|/i — a/q\ < cip{q) 

The convergent and divergent parts of Khinchin's theorem show that for almost every h E R the 
approximation error \h — a/q\ can be made to decay at least as fast as ©(g^^"*""^) but no faster than 
Q{q~^^^) for any 6 > 0. 

The next lemma provides a simple generalization of the convergent part of Khinchin's theorem to more 
than one dimension and to approximations with denominator y/q. 

Lemma 3. Let : ^ R+. If 

oo 
q=l 

then for almost every h E R^ there is a positive constant c = c{K, h) such that 

max minl/ifc — afc/^/g| > cihiq) 

k&{l,...,K}ak& ^ 

for all g G IN. 

The lemma implies that, for almost every h E R^, the approximation error 

max min \hk — ak/ ^/q\ 

k ai^^l^ 

can decay no faster than n(g~^/^~^/^^'^) for any 5 > 0. 
Proof: Let Bq be the vectors h E [0, 1)^ such that 

max min|/;,fc — afc/i/g| < ■?/;(g). (12) 

k afeGZ 

Since hk E [0, 1), the integer can be restricted to the set {0, . . . , [y/g] } for all k. Setting a = (a^), we 
see that a vector h is in Bq if and only if it is at a £oo distance of at most ^/'(g) of such a vector a/^/q. 
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Thus, each a G {0, . . . , [y^]}^ contributes at most a subset of volume {2tp(q))^ to Bg. Since there are 
at most {^/q + 2)^ such vectors a, we have 

By the convergence assumption, this implies that 

oo oo 
9=1 q=l 

Applying the Borel-Cantelli lemma (see, e.g., [|27l Theorem 1.6.1]), this shows that 

IJ,{Bq i.o.) = 0, 

where "i.o." stands for "infinitely often" (as a function of q). Thus, almost every h E [0, 1)^ satisfies (fT2l) 
only finitely many times. Since is the countable union of integer cubes riA^il^fc'^fc + 1)' same 
holds also for almost all h E . 

Fix a h E R^^ for which (fT2l) holds only finitely many times. Then there exists a finite number Q{h) 
such that 

max min|/ifc — afc/-y/g| > ^/'(g) 

k QfcSZ 

for all q>Q{h). Set 

max min \hk — a^/ y/q\ 

c = c(i\, ri) = mm< 1, mm 



' q&{i,...,Q{h)} ilj{q) 



and observe that c is positive. Then 



max min \hk — ak/ ^/q\ > cip{q) 

k a^Gli 

for all g G IN, concluding the proof of the lemma. ■ 
We will also need a generalization of the convergent part of Khinchin's theorem to manifolds in 
Euclidean space. We start with a small example to illustrate the setting. Consider again the question of 
rational approximation in (flOl) . This can be generalized to several dimensions as follows. Fix H E R^^^; 
what is the behavior of 

ini^ \ j2k m(lm,khm,k - a\ 



as a function of qm,k G ^? The generalization of Khinchin's theorem to this setting is referred to as 
Groshev's theorem. 

A further generalization, and the one that will be needed in this paper, is to allow for functions of H. 
Let ^ be a collection of functions g: R^^^ — )■ R. Fix H E R^^^; what is the behavior of 

T^j^\J2geg^99iH) - a\ 
as a function of qg E Z? In particular, we will be interested in the collection of functions 

^L^lflflh'-i:SE{0,...,L- l^xA. (13) 

Lfc=lm=l ) 

In words, Gl is the collection of all monomials in the channel gains H with exponents between and 
L — 1. In the following, we will usually fix a particular realization of H and treat the set Ql as a collection 
of points in R. 

Remark 3: It is straightforward to verify that, for almost every H E R^^^, all L^^ monomials in 
Ql evaluate to distinct numbers. This implies that, for almost every H E R^^^, we have \Qi\ = . 
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Furthemiore, again for almost all H, every g E Gl can be uniquely factorized into powers of /im,fc- In 
other words, to each g corresponds a unique set of powers S such that 

m,k'''m,k ■ 

We refer to this as the unique factorization property. Given that we are only interested in results that hold 
for almost every channel matrix H, we may assume in the following that Ql has this unique factorization 
property. 



The following lemma is a special case of a more general result from [|24|. [|25l (see also [jlQl ). 
Lemma 4. Let ip: ^ R+ be a monotonically decreasing function and L E fi, L > 2. If 



oo 



q=l 

then for almost every H G R^^"^ there is a positive constant c = c{K, H) such that 

for all (g,) G Z^^r-h' \ {0}. 

Lemma m implies that, for almost every H G R^^^, the approximation error 

can decay no faster than 
for any 5 > 0. 

V. Proof of Theorem [H 
We want to upper bound the largest rate 

Ri(H,P)= max RAH,P,A) (14) 



achievable with the lattice coding scheme of pT|. From the above definition, we see that the coefficient 
matrix A can be chosen as a function of P. In particular, to each P corresponds an optimal A = A(P) 
maximizing the right-hand side of (fT4l)FI We consider this A in the following so that 

RaH,P) = RaH,P,A{P)). (15) 

Recall that 

K 

R^{H,P,A) = y2 min R^{hm,P,aJ, (16) 

f — : m;a„ feTiO 
fe=l 

and that Ri{hm, P, dm) consists of two terms, the desired term 

^log(l + P||^„,f) 



and the loss term 



-^log|^||a„f + P(\\hm\\'^\\am\\'^ - [hmoJj'^ 



It can be shown that such an optimal A exists, see 1111 Lemma 1]. If more than one maximizer exists, we choose one of them. 
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We start by upper bounding Ri{hm, P, cim) for a fixed value of ??7 G {1, . . . , K}. Together with (fT6l) . this 
yields an upper bound on Ri{H, P, A) and hence on Ri{H, P). 

We can rewrite the quantity inside the logarithm of the loss term in Ri^hm, P, a.^) as 



P\ \\K 



T\2 



a„f + P||h„f ||a^f (l - cos^(Z(h„, a„))) 
2 + P||/i^f ||a^||2 sin2(Z(/i^, a^)). 



a, 



Now, for X e [-7r/2,7r/2], 



sin (x) > 



-X 



71^ 



and, for Z{hm.,am) measured in G [— 7r,7r], 



> 



hr 



l^mll ll'^n 



by lower bounding the distance along the great circle on the unit sphere by its chordal distance. Since 
R]_{hra, P, o-m) IS invariant to multiplication of by —1, we can assume that Z(h„i, a^) G [—n/2, n/2] 
so that 

2 



P{\\hr„.f\\a„,f 



rP||/i,JI'lla^||2 



h„ 



a, 



\hr. 



\a,r 



> Wttrnf + ^P\\hra\\^\\am\\^ maX 



l<k<K 



h 



— P||^min|amir niax 

TT^ l<k<K-l 



a,- 



m,k 



. (17) 



As we will see shortly, the restriction of the maximum in the last inequality to k G {1, . . . , K — 1} is 
necessary to decouple the (implicit) optimization over into the first K — 1 individual components 
and the magnitude ||am|p. 
Define 

{hm,l ■ ■ ■ hm,K-l) , 



hr 



\hr 



\ar. 



G IN, 



With this, we can rewrite (flTl) as 



max 
fce{i,...,A'-i} 



I ^m.fc ^m,kl \/ Qm I • 



\ar. 



(18) 



We want to minimize the rate loss (iTSl) . The first term in the right-hand side of (1181) is increasing in g^- 
As we shall see, the second term is decreasing in q^- Hence there is a tradeoff between the two terms 
that determines the optimal value of g^- 

The behavior of the approximation error ipmiqm) can be bounded using the convergent part of Khinchin's 
theorem in — 1 dimensions. To this end, note that 

ipm{qm) > min max \hm,k - a,n,k / ^/q^\ 

= max min |/im,fc - am.fc/v^L 
fce{i,...,i^-i}a„,fcez ' .VI 
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which is of the form analyzed in Lemma [3l Applying Lemma [3] in if — 1 dimensions shows then that for 
any fixed 5 > and almost every there exists c = c{K, hm) = c{K, h) > such that 

i^miqm) > cg-V2-i/(K-i)-. (19) 

for all Qm E IN. Observe that this lower bound holds for any choice of a„; in particular, the constant c 
is uniform in a^. We can then continue to lower bound the loss term in Ri{hm, -P, a.m) as 

a +—p\\h W^a ^b"^ ia )>o + —cPWh 11 V^/^^'"^)"^^ 

Hra o II I ^rriYmKHrn J — Hm t r.'^J- |"'m|| Hm 



> maxjg^, ^cP||/i^||2g„2/(x-i)-25|^ ^20) 



This shows the tradeoff between the two cost terms. Recall that we are allowed to choose A = A{P), 
and hence also qm, as a function of power P. Asymptotically, the optimal choice of Qm is 

and hence 

qm + P\\hJ\'-,qm^Uqm) > n{P^'^'/^^-'^^''^-') 

as P — )■ oo. 

Combined with (fTSi) . this shows that, for almost every hm. 



Rdhm, P, aUP)) < I log(l + PWhrnf) - I log(fi(p(^+^/(^-^)+^^)-^)) 



as P — > oo. This implies that 



RLih,P,aUP)) ^ 2+~5 

limsup 7- — - — ^ < (21) 

P^J ^log(P) -K + 1 + 6 



where we have set 

6 = 2{K-1)S>0. 

We have argued that (|2T1) holds for almost every hm E R^^^. It is shown in Appendix |A] that this implies 
that (|2TI) also holds for almost every hm E . 

Up to this point, we have analyzed the rate for a single receiver m. Using the definition of RiiH, P, A) 
in (fT6l ), this yields the upper bound 

K 

RaH,P,A{P)) = y^ min Rt{hm, P, am{P)) 

k=l 

<K max R]_{hm, P, amiP)) 

m&{l,...,K} 

on the sum rate. Together with (|2TI) and using the union bound over mG{l,...,A'}, this implies that 

RaH,P,A{P)) ^ 2+~6 

lim sup ; — < 

P^oo I log(P) 1 + 1/K + 6/K 

for almost every H. As we have assumed that A(P) is the optimal coefficient matrix for power P, this 
implies by (fT5l) that 

Rt{H,P) ^ 2+~5 
lim sup —, — 7757- < 



p^oo |log(P) 1 + 1/K + 5/K 
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for almost every H. Since 5 > is arbitrary, we may take the limit as 5 —t- to obtain 

lim sup —, — < 



P^J ilog(P) - 1 + 1/X' 

yielding the desired upper bound on the degrees of freedom of the lattice scheme. 

We now derive an estimate of the speed of convergence. Observe that we can choose 5 in (fT9l ) as 

, 21oglog(l + g^) 

= o[qm) - 



(i^-l)log(g^) 

and still satisfy the convergence condition in Lemma [3] in K — 1 dimensions. The lower bound (|20l) on 
the loss term in R]_{hm, P, Om) then becomes 



Qm + -^P\\hm\\^qm'ipl,{qm) > max<^ qm, -^cP\\hmfqm ~^ log ^-1(1 + g„) > 
TT^ [ vr^ J 

> log ^-1 (1 + qm) raaxl —cP\\hmfqm^~^ >. 

By [fm Lemma 1], we can restrict the optimization over A to matrices satisfying 

a = \\a < \\h II^P 

so that 

qm + :^P\\hmfqmijl,{qm) > log ^'-1(1 + \\hmfP) maxl qm, ^cP\\hmfqm^~^ y 

We can now solve for the optimal qm- Proceeding as before, we obtain an upper bound on the computation 
rate with lattice coding of 

Ri{H,P)< max K I l\og{l + P\\hmf) - ^lognflog-^^i {1 + \\hmfP)P^'^^^ ]] 

me{l,...,K} \ Z z \ J j 

< Y^^log(P) + 0(loglog(P)) 

as P — !■ 00. This implies that for any K > 2 and almost every H G R^^^ there exists a positive constant 
ci = Ci{K, H) such that for all P > 3 



Ry_{H, P) < Y^Y/K + loglog(P), 



proving the theorem. 



VL Proof of Theorem [2] 

The upper bound in Theorem |2] follows immediately from dV]). We focus here on the lower bound 
showing that 

liminf >K. 

P^oo llog(P) - 

We start with a high-level description of the scheme achieving this performance in Section IVLAI The 
detailed analysis can be found in Section IVLBi 
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A. Description of Communication Scheme 

The proposed coding scheme consists of two components: a modulation scheme and an outer code 
(see Fig. [3]). The encoder fk for the outer code at transmitter k maps the message Wk into the sequence 
of coded symbols {wk\t\)'^^i. The modulator fk at transmitter k maps each coded symbol Wk\t\ into a 
channel symbol Xk\t\. Thus, while the outer code produces a block of coded symbols, the modulation 
scheme operates on a single coded symbol to produce a single channel symbol. The encoder in the 
definition of computation capacity is the concatenation of these two encoding operations. At receiver m, 
the demodulator 0^ computes Um[f\ from the channel output ym[t], and the decoder 0^ for the outer code 
maps the sequence {uraf\)t=i into an estimate of the desired function Um- Both and Um are defined 
as a function of {wk) and {wk), respectively. The decoder in the definition of computation capacity is the 
concatenation of these two decoding operations. 




Fig. 3. Modulation scheme ({/fe},{<^m}) together with outer code ({/fe}, {<^m})- At transmitter fc, the message Wk is mapped by the 
encoder fk of the outer code into the sequence {wk\t\)t=i of modulator inputs. Each ■Wk\t\ is mapped by the modulator fk to a channel 
input Xk[i\. At receiver m, the channel output ym[t\ is mapped by the demodulator into a demodulated equation S,„[f]- The sequence 
of these demodulated equations is mapped by the decoder of the outer code to an estimate of the desired equation Um- 

Note that in the description of the proposed achievable scheme we are using the following notational 
conventions. Quantities related to the outer code are denoted by standard font, i.e., fk, (pm, Wk, ■ ■ ■ The cor- 
responding quantities related to the modulation scheme are indicated by bars, i.e., fk, 4>m, Wk, ■ ■ ■ Estimated 
quantities are indicated by hats, i.e., the output Um of the decoder of the outer code is an estimate of the 
correct output Um, and similarly the output Um of the demodulator is an estimate of the correct output 

The construction is as follows. Each message Wk is split into \Ql\ = submessages {wk,g)g^gj^ vvith 
Ql defined in (fT3l) . Every fk encodes each of these submessages Wk,g using the same linear code. Thus, 
all encoders [fk] are identical. The modulator fk combines these \Ql\ codewords into a single sequence 
of channel inputs. 

Consider now receiver m. The channel to this receiver is in effect a K-mtx multiple-access channel 
(MAC). By splitting the transmitted message into submessage, we have transformed this A'-user MAC 
into a A' I I -user MAC, with each user corresponding to one submessage. The demodulator 0^ splits this 
MAC at receiver m into subchannels. Through careful design of the modulators, this splitting can 

be done such that each of the resulting |^l+i| MACs outputs a (noisy) sum of only K out of the K\Ql\ 
possible input signals. Observe that this channel is linear with integer channel coefficients. Hence, the 
linear codes used as outer code can now be efficiently decoded. The decoder of the outer code is thus 
chosen to recover the submessage corresponding to the sum of the K codewords seen over this MAC. 
Decoding is shown to be possible with vanishing probability of error with a rate of order 2\g\+-i^\ los(-P) 
each of the submessages for large P. Moreover, it can be shown that the resulting collection of decoded 
functions is invertible. 

Since there are submessages for each of the K transmitters, the sum rate achieved by this scheme 
is on the order of P\Q \ K 

The scheme achieves therefore 

K 



(1 + l/LY' 

degrees of freedom. For large L, this is approaches the K degrees of freedom claimed in Theorem |2l 
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B. Detailed Proof of Achievability 

The proof of the theorem consists of three steps. First, we show how the modulation scheme transforms 
the noisy linear combinations with real coefficients produced by the channel ([T]) into a system computing 
noisy linear combinations with integer coefficients. Second, we show how the outer code further transforms 
this modulated channel into a system computing noiseless linear combinations with integer coefficients. 
Third, we argue that the linear combinations produced by the outer code are invertible, i.e., the messages 
at the transmitters can be recovered from the computed linear combinations of all receivers. 

We now describe the operations of the modulation scheme in detail (see Fig. H]). Recall the definition 
oi Qi'm (fT3] ) as the collection of all monomials in the channel gains with exponents between and L — 1. 
The input symbol Wk[t] to the modulator at transmitter k at time t consists of \Qi\ subsymbols 

Wk[t] = {wk,g[t])gegL^ G {0,...,p- 1} VA;,5(,t 

for some p, L G IN to be chosen later. The output symbol Um[t] of the demodulator 0^ at receiver m at 
time t consists of subsymbols 

Um[t] = iUra,g[t])gegL+i^ Um,g[t] G {0,...,p- 1} \/m,g,t. 

Note that the number of input and output subsymbols per time slot are not the same. 

("'2,9)966^ irQD^^2 



Fig. 4. Modulation scheme {{fk}, {<t>m})- The modulator fk at transmitter k takes {wk,g)geQL input. The demodulator (j)m at receiver 
m produces {um,g)geQL+i output. Indicated by the dashed box is the modulated channel obtained by viewing the modulation scheme 

as part of the channel. This modulated channel is discrete and memoryless. All operations take place over a single time slot t; the dependence 
of Wfe,g, ifc, j/m, Um,g ou t is Omitted in the figure. 

The modulator at transmitter A; is a linear map, producing the channel input 

Xk[t] = fk{{wk,g[t])g^g^) Wk,g[t]g (22) 

g&GL 

with 

B = B{L,p) = (A»l^^+il. (23) 

For g G Gl+i, define 

K 

Um,g [t] = J2 ^Ua/h^.,) M , (24) 

fc=l 

where we use the convention that Wk,{g/h,-,-,k)\^] = whenever g/hm^k ^ Gl- 

The definition of ^^,n,g[^] can be interpreted in the following way. Let g G Gl, and consider the term 
Wk,g[t]g in the definition of Xk[t]. At receiver m, this term is observed as Wk,g[t]ghm,k- Thus, for any 
g G Gl+1, Um,g[t] is the sum of all input subsymbols {wk,g[t])g^g^ that are observed with coefficient g at 
receiver m. Another way to see this is as follows. The signal observed at receiver m is 

K 

ym = B^^^ Wk,g[t]hm,kg + Zm- 
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Now, note that hm,k9 is a monomial in the channel gains with highest exponent at most L. Hence hm,k9 £ 
Gl+1 for all m, k. Using the definition of Um,g, we can rewrite the received signal as 

K 

k=l geQL+i 
= 5 ^ Um,g[t\g + Zm- 

This last equation is a key step in the construction of the achievable scheme. It shows that the received 
signal can be decomposed into terms Um,g, each multiplied by a different effective channel gain g. 

Crucially, each of these terms is an integer linear combination of up to K input signals Wk,{g/h^ k)^ 
from each transmitter. 

The demodulator (j)„t at receiver m is the maximum likelihood detector of {um,g[t])g^g^^-^, i.e., 
4>m{ym[t]) = arg max P(nggg^^^ [t] = | ym.[t]), 

{Um.g) 

where the arg max is over all possible values of {um,g)g<^gi^+i- Denote by 

{Uni,g[t])g(igL+-, = (pm{ym[t]) 

the output of the demodulator. The probability of demodulation error at receiver m is then defined as 
Observe that the goal of the demodulator is to recover |^l+i| integers {um,g[t])g from a single observation 

Vrnit]. 

The next lemma describes the performance of this modulation scheme. 

Lemma 5. For any K >2 and almost every H G R^^^ there exist positive constants C4 = C4{K,H) 
and C5 = c^{K, H) such that, for all p, L, t G IN and /c G {1, . . . , K}, the input signal to the channel has 
power at most 

= P{L,p) (25) 

and 

implying that the probability of demodulation error is at most 

P(U3eg,+,{wn^,,[t] ^ u^,g[t]}) < F{{\z^[t]\ > c,p'/^}) 

< exp(-|c5p) 

= e{p). (26) 

The proof of Lemma \5\ is presented in Section IVLCI 

Lemma [5] bounds the power of the channel input [t] and, more importantly, states that the probability 
of demodulating in error decreases exponentially in p. Thus, when p is large enough, the probability of 
demodulation error is small. 

The original channel between transmitters and receivers produces noisy real linear combinations of 
the channel inputs. After applying the modulation scheme, we have transformed this into a channel that 
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produces noisy integer linear combinations of the channel inputs. More precisely, using the definition of 
Um,g in (l24l) . we can write this new channel as 

K 

= ^^.{f M + ^m,g [t] , (27) 

k=l 

where we have defined the modulation noise Zm,g[t] as 

Zm,g[t] = Um,g[t] - Um,g[t]. (28) 

Thus, we see that the channel resulting between the input of the modulator and the output of the 
demodulator computes noisy linear combinations with integer (indeed, either zero or one) coefficients. 

We refer to this new channel after modulation as the modulated channel. Since the modulation scheme 
operates on a single time slot t, this modulated channel is discrete and memoryless. Note that the noise 
Zjn,g of this modulated channel is not necessarily additive, i.e., Zm,g[t] is not necessarily independent of the 
channel input Um^g[t]. However, by Lemma [5l we know that the noise is small. This modulated channel 
is depicted in Fig. lU 

As we have argued before, the probability of demodulation error e{p) goes to zero as p — t- oo and hence, 
by (l25l) . as power P ^ oo. However, the definition of computation capacity requires that the probability 
of error be arbitrarily small for fixed power P. The next step is therefore to transform the modulated 
channel into a system producing noiseless integer linear combinations of the channel inputs. To this end, 
we employ an outer code over the modulated channel. We call the encoder and decoder of this channel 
code fk and for transmitter k and receiver m, respectively. It will be convenient to choose p to be 
a prime number. Reducing the modulator outputs modulo p, we can then interpret the input and output 
subsymbols Wk,g[t] and Um,g[t] as well as the integer linear combinations performed by the channel as 
being in the finite field Fp. We refer to the resulting channel over Fp as the modulated Fp channel. 

We now describe the operations of the outer code in more detail (see Fig. [5]). The channel encoder 
at transmitter k consists of \Ql\ sub-encoders 

fk — {fk,g}gGgL- 

The channel decoder gm at receiver m consists of sub-decoders 

<Pm — {'Pm,g}g£gL+i- 

(™3,9)9eaL 




Fig. 5. Outer code ({/*;}, {(/'m}) over the modulated Fp channel. Each encoder fk uses the same linear map given by the matrix 
S G jpj^"^^/ 0^ jjjg corresponding minimum distance decoder for the equations Um- 

Consider the message Wk,g at sub-encoder g of transmitter k. It will be convenient in the following to 
express this message as a vector in ]p™/'°s(p) Whenever this vector structure is relevant, we will write 
the message as Wk^g. The encoder ^ maps Wk^g to the vector (or, equivalently, sequence) of modulator 
inputs 

= i^k,g[t])ll e Fj. 

The rate of this channel encoder is hence 

log(p'r^/i°s(p)) 

7^ = R 

T 
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bits per use of the modulated channel. The encoder is specified by the linear map 

'l"k,g = fk,g{'Wk,g) = SWk,g, (29) 

for some matrix S G ]pj^^^/'°s(p)^ ^nd where the multiplication is understood to be over Fp. We point 
out that S does not depend on k and g. In other words, each encoder /^ ^ uses the same linear map. 
Define the vector version of the demodulator output Um,g as 



Similar to the definition of Um^g[t] in (|24l) . set 

K 

'^m,g = ^Wk,(g/h^ ,) (mod p), (30) 

k=l 

where we again use the convention that w.m,{g/h^ fe) = whenever g/hm.^k ^ Gl- 

Recall that the modulated channel computes noisy linear combinations over the finite field Fp. Since 
all channel encoders use the same linear code, this implies that the output of the subchannel g at receiver 
m is equal to Sum,g plus small noise ^ resulting from erroneous demodulation as defined in (l28l) . As 
pointed out earlier, the noise term Zm,g may not be additive, i.e., Zm^g may be dependent on the channel 
inputs. Formally, the (vector) of demodulated equations ii^.g is equal to 

K 

'^rn,g = ^ ^k^g/h^^k) + Zm,g 
k=l 
K 

= ^ SWk,(g/h^^k) + ^m,g 
k=l 
K 

= 5 ^ Wk^g/h^^k) + ^m,g 
k=l 

(c) 

= SUm,g + Zm,g (modp), (31) 

where (a) follows (|27|) . (b) follows from the definition of the encoder ^ in (|29l ). and (c) follows from 
the definition of the equation Um,g in (|301 ). Thus, since all transmitters use the same linear code, the 
encoding operation commutes with the operation of the channel. Note that (l24l) and (|3T| ) imply that 

'^m,g SUm,g- (32) 

The decoder 0^,^ of the outer code is the minimum (Hamming) distance decoder, i.e.. 



T 

^m,g\^m,g)= aig min ^l[u^^g[t]^ {SUm.^g)[t]), 



U,n,,6F™/'°-('') t=l 



where {Sum^g)[t\ is component t of the vector Sum,g- Note that this decoder might not be the same as 
the maximum likelihood decoder, depending on the distribution of z^^g. Denote by 



1J'm,g (Prn,g\Um,g j 



the output of the decoder of the outer code. The probability of error of this code is defined as 

^ {}^m,g{^m,g 7^ ^m,g})- 

For X E (0, 1), define the p-ary entropy function 'Hp{x) as 

T-l-p{x) - r—TT log(P -I) -X log(x) - (1 - x) log(l - x)) . (33) 
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The next lemma states that for the modulated Fp channel there exist linear codes S with large rate that 
allow reliable decoding at each receiver. 

Lemma 6. Denote by e{p) the upper bound on the probability of demodulation error as defined in (|26|) 
in Lemma \5\ For every prime number p such that e{p) < 1/4, and every 77 G (0, 1/2 — 2e{p)) there exists 
a linear code S (of sufficiently large blocklength T ) for the modulated Fp channel with rate bigger than 

(l-Hp(2£(p)+r/))log(p) 

and probability of error less than 77. 

The proof of Lemma [6] is presented in Section IVLDi 

Lemma |6] shows that, asymptotically in the blocklength T, reliable communication over the modulated 
subchannels is possible at rates arbitrarily close to 

{i-npi2E{p)))\og{p), 

with probability of demodulation error e{p) as defined in Lemma [51 Since there are K transmitters each 
with 1^2,1 subchannels, the sum rate achieved with this coding scheme is at least 

K\gL\{l-npi2eip)))\ogip). 

Note that 

lim np{2e{p)) = 

so that the sum rate is of order 

K\gL\{i-oii))\og{p) 

as p — 00. 

To satisfy the definition of computation capacity, we need to argue that the mapping from (wk^g) to 
{Um,g) defined in (l30l) is deterministic and invertible over its range. As the channel gains H are constant 
and known, the mapping is clearly deterministic. The next lemma shows that the mapping is also injective 
(and hence invertible over its range). 

Lemma 7. Let p be a prime number. For any K > 2 and almost every II G R,^^^, the mapping from 

{wk^g : A; G {1, . . .,K},g G Gl) 

to 

{um,g : m G {1, . . . , K], g G 

is injective over Fp. 

The proof of Lemma |7] is presented in Section IVLEI 

Together with Lemmas \5\ and [6l Lemma U\ shows that for every prime number p, a computation rate 

C{H,P{L,p)) > KIGlKi - 'Hp{2e{p)))\og{p) (34) 

is achievable, with 

P(L,p) = cf(ii'p)2|^^+^lL2^V 

as defined in (l25l) . 

Fix a power P, and let p and p be two consecutive prime numbers such that 

P{L,p)<P<P{L,p). 

By Bertrand's postulate (see, for example, |l28l Theorem 5.7.1]), any two consecutive primes p and p 
satisfy, 

p < p < 2p. 
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Since P{L,p) is increasing in p, this implies that 

P{L,p)<P<P{L,2p). (35) 



Combining (1341) and (1351) shows that, for every power P and corresponding prime number p chosen as 
above, 

where 



D{p) 



ilog(P) - ilog(P(L,2p)) 

K\gL\{l-np{2e{p)))log{p) 



K\Ql\ 




\'3l+i 


1 + 1 



iLlog(c4) + |6;L+i|log(2A» + A'2 1og(L) + log(2p)- 
Since p — oo as P — j- oo (with K and L fixed), this implies that 

CiH P) 

P-s>oo i log(P) p-s>oo 

where the limit p — oo is understood as being taken over the prime numbers. By Remark |3] in Section HVl 
we have 

for almost every H. Hence, this shows that 



ilog(P) - {L + l)^' + l 
As this is true for all values of L, we may take the limit L — oo to obtain 

lim inf -r^ — ] — }- > lim —r-^ = K. 

P->oo 1 log(P) - L^oc (L + 1)^' + 1 

Hence, the proposed implementation of compute-and-forward achieves K degrees of freedom, as needed 
to be shown. 

We now derive an estimate of the rate of convergence to this limiting value. Fix a power P. Sej^ 

1 



L(P) ^ logi+/^^ (P), 

and let p{P) be the largest prime number p such that P(L(P),p) < P. Using Bertrand's postulate as 
before, we obtain that 

P{L{P),p{P)) <P< P{L{P),2p{P)). (36) 
Solving P(L,p) in (l25l) for p yields 



p = 

Together with (|36l ). this implies that 



log(p(P)) = .nr""^^^]^,, - 0(1), (37) 

^(|t^L(P)+ll + 1) 



where we have used that |^l+i| = [L + 1)^^. 

*The number L{P) might not be an integer. We ignore the rounding error since it is immaterial as P — j- co. 
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From (1341) and OH) 

C{H,P)>C{H,PiL{P),piP))) 

= K\gL\{l-n,{2e{p)))\ogip), (38) 

where we have suppressed dependence of p and L on P. By the definition of the p-ary entropy function 
'Hp(x) in (l33l) . we have for any a; G (0, 1) 

'Hp(x) < X + 'H2(a;)/log(p). 

Therefore, (l38l) can be further lower bounded as 

C(/f,P) > K\gL\{il-2e{p))\og{p)-n2{2e{p))). 



Substituting (l37l) . 

From Lemma 151 
Moreover, 



C(/f , ^) > f (1 - 2e(p)) ,^ '^r, , log(P) - I^Lie(l). (39) 











+ 1 



1 - 2e{p{P)) >l-0 (\og~ (p) j . 



K 



2 



i^L(P)+ii + i vv L(p); {L{p)Y' 

> 1 -0(^log~i+^(P) 



and 



|e^L(P)l=logl+^^(P). 



Combining this with (1391 ) yields 

K/ „/ _ „/. 



C{H,P) > y(^l -0(^log"i+^^(P)))log(P) -0(^logi+^^'^(P) 
> ylog(P)-o(logTTF^(P)). 



Thus, for every > 2 and almost every H E R^^^ there exists a positive constant C2 = C2(A', J?) such 
that for all P > 2 

C{H,P)>^ log(P) - C2 logT^ (P) , 
completing the proof of the theorem. ■ 



C. Proof of Lemma \5\ 

Since modulation involves only a mapping of one symbol at a time, we can drop the time indices in 
the following discussion (e.g., we write Xk for x^ft]). 

We start by analyzing the power of the transmitted signal Xk- Each iVk,g takes value in {0, . . . ,p — 1} 
and hence has power wl ^ < p^. Moreover, |^l| = , and each g eQl satisfies 

1^1 < (max{l,max^ jr|/i^^^|})^^ . 
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2^2 



Thus, the channel input Xk as defined in (l22l) has power at most 

< B^L^^^p'^ (max{ 1, max^^^ | h^-^ 

Defining the positive constant 

C4 = Ci{K,H) = (max{l,max^^^|/i^^^|}) 
and using the definition of B in (l23l) . we obtain 

as required. 

We continue by analyzing the probability of demodulation error. Recall that the received signal 

K 
k=l 

K 

= ^ ^ Wk,ghm,kg + Zm 

can be rewritten as 

ym = B ^ Urn,gg + Zm, (40) 

with Urn,g as defined in (|24|) . Receiver m aims to demodulate the functions {urn,g)gegL+i from ym- We 
now argue that this is possible with small probability of error. 

Consider a different set of linear combinations ("m,g)9egL+i i^rn,g)gegL+i^ compute the difference 

l^geGL+^^^'d - ^'mj9\ = lEgGSi+i^S^I- (41) 

Note that Um,g G {0, . . . , K{p — 1)} and hence Qg G {—K{p — 1), ... , K{p — 1)}. Moreover, observe that 
Um.,g = u'^^g = for (yf = 1 in any valid set of equations and almost every H, so that gi = and can be 
ignored. By Lemma HI for almost every H there is a finite constant c = c{K, H) such that 

\EgeG.,,,g^l%9\>c{KpP'^^^^^y' 

for all {qg)g^, G {-K{p - 1), . . . , - (g,),^i ^ 0. 

Combined with (|40l ) and (|4TI) . this shows that the minimum distance between any two signal points at 
receiver m is at least 

cS(i^p)-l^^+il+i/l 



Using the definition of B in (1231) . we see that the minimum distance between the desired set of equa- 
tions {um,g)g and any other set of equations {u'^g)g is at least c(i^p)^/^. Therefore, the probability of 
demodulation error is upper bounded by 

^{Uy{Um,g ^ Um,g}) < ^ {\Zm\ > Cr^P^'"^) 

with 

C5 = C5{K, H) = ^ . 

This, in turn, can be upper bounded using the Chernoff bound as 

P(|zi| >C5P^/2) <exp(-ic^p), 
concluding the proof of the lemma. ■ 
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D. Proof of Lemma |6| 



We derive a lower bound on the rate achievable with linear codes over the modulated Fp channel. 



Recall that each channel encoder uses the same linear map S. By (I3TI) . the output of the sub-demodulator 
g at receiver m reduced modulo p is 

Um,g = SUm,g + Z„^,g (mod p) , 

with (non-additive) noise ^ satisfying 

Hz^,g[t] ^ 0) < l{\zm[t]\ > c^p"'^) (42) 

for all g G Ql+i, and 

nW^[t]\>c^p"^)<e{p)=e (43) 

by Lemma m Thus, we only need to analyze the performance of linear codes over a point-to-point channel 
with input and output alphabets Fp and (non- additive) noise ^m,g[t]. 

By the Gilbert- Varshamov bound (see, e.g., [|29l Theorem 12.3.2, Theorem 12.3.4]), for every T, prime 
number p, and 2 < d < T /2, there exists a linear block code of length T over Fp with rate at least 

(l-?/p((rf-l)/T))log(p), 

and minimum distance at least d. Recall that < e < 1/4 by assumption, and fix 77 G (0, 1/2 — 2e). 
Choose 

d = l{2e + 7])T\ > {2e + r])T - 1, 
so that the linear code can correct up to 

[{d - 1)/2J > [{e + v/2)T - IJ > (e + r//2)T - 2 

errors. 

Since we use minimum-distance decoding at the receivers, this implies that we make an error only if 
the noise has Hamming weight larger than (e + ri/2)T — 2, i.e., if 



m^Y.l{zm,g[t] ^ ^) > {e + Til2)T-2. 
^ t=i 

Using (l42l) . we have 

P Lax 1 (^-,9 [t] 7^ 0) > (e + r,/2)T - 2^) < P 1 (| z„ [t] | > c^p"^) > {e + r,/2)T - 2 
^ ^ t=i ' ^t=i 

Since (zm[^])t is i.i.d., the weak law of large numbers shows together with (|43l) thaiZl 

lim P ("max ^ ^ 0) > (e + r//2)T - 2^ = 0. 

In particular, for T large enough this probability is less than rijK, so that with probability at least \ — r] 
all decoders are able to decode correctly. The rate of this code is at least 

(1 - Hp((d - 1)/T)) log(p) > (1 - Hp(2£ + r/)) log(p), 

where we have used that 'Hp(x) is increasing in x for x < 1/2. This proves the lemma. ■ 

^We point out that the law of large numbers does not apply to 2„i,s [i], since this sequence is dependent on the channel input and therefore, 
without further assumptions on those inputs, not i.i.d. 
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E. Proof of Lemma [7| 

We need to show that the map from the input to the encoder of the outer code 

to the correct output of the decoder of the outer code 

[u^^g : m G {1, . . . , K}, g G 

is injective. The mapping from Wk,g to Wk^g defined in (|29l ) and the mapping from Um.^g to Um,g given by 
(|32l ) are both invertible over their range. Hence, it suffices to prove injectivity of the map from the input 
to the modulators 

{wk,g : A; G G Gl) 

to the correct output of the demodulators 

{um,g : m G {1, . . . , K], g G 

over Fp. Finally, since this map is defined at the symbol level, it suffices to prove injectivity for a single 
time slot t. To simplify notation, we drop the index t in the following, i.e., we write Wk,g and Um,g for 
Wk,g[t] and Um,g[t]- Thus, we need to show that the map from 

{wk,g : k e {1,.. .,K},g G Gl) 

to 

{um,g : m G {1, . . . , K}, g G Gl+i) 
is invertible over its range. This map is defined in (|24|) as 

K 

Um,g[t] = ^Wk,ig/h^,,)[t\ (mod p) , 
k=l 

with the convention that Wk^g = whenever Wk^g ^ Gl, and taking into account that the modulator output 
is reduced modulo p. 

In the remainder of the proof, we make repeated use of the fact that, for almost every channel realization 
H, every monomial (evaluated at H) in Gl+i can be uniquely factorized into powers of {hm,k), see 
Remark |3] in Section |IVl We refer to this as the unique factorization property in what follows. For 
g G Gl+1, we use the notation /i^^ | (7 to denote that /i^^, is a factor of g in this unique factorization. 

We begin with a small example with L = K = 2. Recall that the channel gain between transmitter k 
and receiver m is denoted by hm^k- By definition, 

G2 = {K'l hl]i h'^ i^ i : si,2, S2,i, 82,2 G {0, 1}} 

and 1^2! = 16 for almost every H by unique factorization. 
Consider the received monomial ^ G ^3. If 

hli I g 

at receiver 1, then g can have originated only from transmitter 1 by unique factorization. In other words, 
ui,g = yJi,{g/hi 1) and hence wi,(g/hi 1) can be recovered from the received signal. If 

^2,1 I 9 

at receiver 2 than again g can have originated only from transmitter 1 by unique factorization. Hence 
U2,g = ^i,{5//i2i) ^i,(3//i2i) be recovered. We can then remove these messages from all other 
equations at the receivers. 
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A similar conclusion can be drawn if /i^ 2 ^ factor at receiver 1 or respectively /ig 2 ^ factor at receiver 
2. In either case the monomial originated at transmitter 2, and the corresponding message W2,g can be 
recovered again by unique factorization and the terms again be removed from all received signals. 

Other parts of the signals may also be identified as only originating from transmitter 1. For example 
^i,i/i2,i at receiver 1 cannot be seen as a message from transmitter 2 because hi^2 is not a factor. All 
such messages can be decoded and removed as was done with the previous messages, again by unique 
factorization. 

This leaves only the message wi^g with g = /ii,2^2,2 from transmitter 1 and the message W2,g with 
g = /ii,i/i2,i from transmitter 2 to be determined. But /ii,2^2,2 from transmitter 1 is observed at receiver 
1 as 

g = ghi,i = /ii, 1/^1,2/^2,2 
which, to have originated from transmitter 2, would be transmitted as 

g/hi^2 = /ii,i/i2,2- 

However, this message was already removed in the first round (since it is observed at receiver 2 as 
/ii 1/122) and so the remaining signal at receiver 1 must have originated from transmitter 1. Thus wi^g can 
be determined. The same can be done for g = /ii, 1/^2,1 from transmitter 2, and the message W2,g can be 
obtained. This completes the example as all messages Wk,g for g G G2, k E {1,2} have been determined. 

We now extend the above argument to A' = 2 but L arbitrary, proceeding by induction. We will argue 
that the messages Wk,g,k = {1,2}, g G Ql are completely determined by Um^g where m G {1,2} and g 
ranges over the possible received monomials in Gl+i- The proof is by induction on L, and our earlier 
argument for L = 2 anchors the induction. 

Suppose the induction hypothesis holds for L — 1 > 2. We now show it holds for L as well. As before, 
determine and remove from the received signals all messages Wi^g, such that h^^^ \ g or h2i^ \ g at 
transmitter 1 (using unique factorization). Do the same, but for messages W2,g such that h^2^ I 9 or 
^22^ I 9 transmitter 2. 

Now, consider g at transmitter 1 such that /if 2^ I 9- This will be received as 

9 = 9^2,1 

at receiver 2, and either /i2,2 I fi' or it is seen as a monomial component originating from transmitter 1 
immediately by unique factorization. To be from transmitter 2, the transmit monomial would be 

9/h2,2- 

But then h^2^ \ (^7/^2,2) and therefore the message corresponding to this signal has already been removed 
from both receivers. 

The same is true for all messages wi^g with g such that /i2 2^ I 9 transmitter 1. Moreover the same 
arguments apply to transmitter 2 but with /if ^21^- follows that all factors Wk^g with the highest 

exponent in g being L — 1 have been determined for both transmitters. The remaining monomials make 
up at both transmitters. Since the factors involving monomials with highest exponent L — 1 have 
been removed, we may apply the induction hypothesis to complete the inversion. Thus, for K = 2 and 
arbitrary L > 2, the mapping can be inverted. 

It remains to consider K > 2 and L > 2. We will argue that the factors Wk,g, /c G {1, 2, ■ ■ ■ , K}, g E Gl 
are completely determined by Um,g where m = {1,2, ■■ ■ , K} and g ranges over the possible received 
monomials in Gl+i- As earlier, we proceed by induction, but this time on K. The result holds for K = 2 
as we have already demonstrated. 

Suppose then the result holds for K — 1 > 2, and consider the case with K transmitters and receivers. 
Fix L > 2 arbitrarily. For each transmitter k, we can once again remove all the factors Wk,g whenever 

K,k I 9 
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for some m. 

Now let us fix a receiver, say m = 1, and a transmitter, say k = 2. Note this choice is entirely arbitrary 
as transmitters are in no sense tied to receivers so we may re-index them to obtain this case. Consider a 
monomial g at transmitter k such that 

At receiver 2, this is observed as 

9 = 9h2,k- 

For such a monomial to be seen at receiver 2 as originating from transmitter 2, we must have that 

^^2,2 I 9, 

since otherwise we can rule out transmitter 2 at receiver 2 by unique factorization. However, if this is the 
case, we see that the corresponding message has already been removed for transmitter 2 as 

ht2' I i9/h2,2). 

Thus under either outcome this monomial component cannot be seen as originating from transmitter 2. 

Proceed as follows. Consider receivers m G {2, 3, ■ ■ ■ , K} (leaving out receiver m = 1) and collect all 
equations such that m ^ 1 and 

ht2' I ~9 

using unique factorization. We may consider these as originating only from transmitters k E {1,3, ■ ■ ■ , K} 
(leaving out transmitter ^ = 2) by the argument in the preceding paragraph. We thus now have the problem 
of identifying Wk,g such that /if 2 ^ I 5'; ^ 7^ 1' using the received signals at 7n ^ 1. Denote the corresponding 
set of transmit monomials by 

Q'^ = {9^QL:h\f\g}. 
Observe that any power of the channel gains /im 2, 7^ 1 and hi k, k ^ 2 may be a factor of the monomials 

m . 

1 2 

To proceed further, note that the monomials in Qj^ can be partitioned into equivalence classes such 
that each g in the same class has the same factors 

^fe'n^i:fc'nc2^ (44) 

fc^2 m^l 

in their unique factorization for some fixed < ^ < L — 1,0 < 3^,2 < L — 1. We call (l44l) the 
(l,2)-factor of g. We may also partition the receive monomials according to their (l,2)-factor. Denote by 

^L^((Sl,fc)fc^2, (Sm,2)m^l) C Q]^'^ 

the equivalence class with (l,2)-factor 

^fe^n^s'nc2^- 

Fix a class (si^fc)fc^2, (sm,2)m7^i, and consider the messages 

(wk,g ■■k^2,g e Q]f'{{Sl,k)k^2, {Sm,2)m^l)^ (45) 

and the equations 

{um,g ■m^l,g e G^li ((Sl,fc)fc^2, (Sm,2)m^l)) • (46) 



Recall that we have removed all messages W2,g from the equations (1461) . Observe that any equation Um,g 
in (|46|) is then solely a function of the messages in (l45l) . 
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Divide out the common (l,2)-factor from the transmit and receive monomials in (|45l) and ( |46l ). This 
results in a set of messages and equations with monomials in the channel coefficients Kn^k, k ^ 2,m j^ 1 
with K — 1 transmitters and K — 1 receivers. By our induction hypothesis, we may invert to obtain all 

1 2 

Wk,g,g G Q£ , working with each (l,2)-factor class in turn. 

However, the choice of k = l,m = 2 plays no special role in the above argument as we have already 
explained, so that we may recover Wk^g for all monomials 

and any choice k,m E {1, 2, ■ ■ ■ , K}. Removing these decoded messages from the received equations, we 
have reduced the monomials to have exponent no higher than L — 2. Hence, we may proceed iteratively, 
reducing the order of L by one in each iteration. Thus, Wk,g can be recovered for all A; G {1, 2, ■ ■ ■ , K}, g e 
Ql, that is, the mapping is invertible over its range. ■ 

VIL Conclusion 

We considered the asymptotic behavior of compute-and-forward over a section of a relay network 
with K transmitters and K relays. We showed that the lattice implementation of compute-and-forward 
proposed by Nazer and Gastpar in [[TTIl achieves at most 2/(1 + 1/Ji) < 2 degrees of freedom. Thus, the 
asymptotic behavior of the lattice scheme is very different from the MIMO upper bound resulting from 
allowing full cooperation among transmitters and among relays and achieving K degrees of freedom. We 
then argued that this gap is not fundamental to the compute-and-forward approach in general, but rather 
due to the lattice implementation in ifTTTl . To this end, we proposed and analyzed a different implementation 
of compute-and-forward and showed that it achieves K degrees of freedom. Thus, at least in terms of 
degrees of freedom, compute-and-forward can achieve the same asymptotic rates as if full cooperation 
among transmitters and among relays were permitted. 
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Appendix A 

Change of Measure in the Proof of Theorem [H 

Here we show that if (|2TI) in Section |V] holds for almost all h G R^ ~^, then it also holds for almost 
all h G R^. In the following discussion, we use the notation fix to denote Lebesgue measure over R^. 
Let B C R^~^ be a set of vectors h G R^^^ of measure zero, i.e., 

fiK-iiB) = 0. 

Let C R^ be the set of vectors h eR^ such that 

^ (/ii ■ ■ ■ hK-i) G B. 
We want to show that D has also measure zero, i.e., /iA'(-D) = 0. We have 

l-iK{D)= [ lD{h)dh 
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Making the change of variables 



hk = ^y for A;G 
s = \\h\\, 



and using the nonnegativity of 1^ together with Fubini's theorem, we can rewrite this as 

Hj,(D)= s^-W tBih){l - \\h\\Y^/^dhds. 

Js=0 Jh&W^-^MhiKl 



Now, 

[ iB{h){i-\\hfy^/^dh 

JheR'<-'^:\\h\\<l 



< lB{h)s~^dh+ {l-\\hfY^'^dh 

Jh&n'i-'^:\\h\\<VT^ Jh<^n'i-^:VT^<\\h\\<l 

< e-'f^K^B) + 2 / (1 - S'r'/'dS 

r((A - l)/2) J,/TZ^<rs<i 



— — (^7r/2 - arcsin(Vl-£2)j 



r((ir-i)/2) 

for every e > 0, and where r(-) denotes the Gamma function. Letting e — )■ 0, we obtain 

ih,eR^-i:||^il<i 

and hence 

/ixP) = 0. 

This shows that (f2T)) holds also for almost every H G R^^^. 
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